Normalization of zero-inflated data: An empirical analysis of a new indicator family
نویسندگان
چکیده
Recently, two new indicators (Equalized Mean-based Normalized Proportion Cited, EMNPC, the Mean-based Normalized Proportion Cited, MNPC) were proposed which are intended for sparse data. We propose a third indicator (Mantel-Haenszel quotient, MHq) belonging to the same indicator family. The MHq is based on the MH analysis – an established method for polling the data from multiple 2×2 contingency tables based on different subgroups. We test (using citations and assessments by peers) if the three indicators can distinguish between different quality levels as defined on the basis of the assessments by peers (convergent validity). We find that the indicator MHq is able to distinguish between the quality levels in most cases while MNPC and
منابع مشابه
Normalization of zero-inflated data: An empirical analysis of a new indicator family and its use with altmetrics data
Recently, two new indicators (Equalized Mean-based Normalized Proportion Cited, EMNPC; Mean-based Normalized Proportion Cited, MNPC) were proposed which are intended for sparse scientometrics data. The indicators compare the proportion of mentioned papers (e.g. on Facebook) of a unit (e.g., a researcher or institution) with the proportion of mentioned papers in the corresponding fields and publ...
متن کاملField- and time-normalization of zero-inflated data: An empirical analysis using citation and Twitter data
Thelwall (2017a, 2017b) proposed a new family of fieldand time-normalized indicators, which is intended for sparse data. These indicators are based on units of analysis (e.g., institutions) rather than on the paper level. They compare the proportion of mentioned papers (e.g., on Twitter) of a unit with the proportion of mentioned papers in the corresponding fields and publication years (the exp...
متن کاملModeling the Number of Attacks in Multiple Sclerosis Patients Using Zero-Inflated Negative Binomial Model
Background and aims: Multiple sclerosis (MS) is an inflammatory disease of the central nervous system.The impact of the number of attacks on the disease is undeniable. The aim of this study was to analyze thenumber of attacks in these patients.Methods: In this descriptive-analytical study, the registered data of 1840 MS patients referred to the MS clinicof Ayatollah Kash...
متن کاملHurdle, Inflated Poisson and Inflated Negative Binomial Regression Models for Analysis of Count Data with Extra Zeros
In this paper, we propose Hurdle regression models for analysing count responses with extra zeros. A method of estimating maximum likelihood is used to estimate model parameters. The application of the proposed model is presented in insurance dataset. In this example, there are many numbers of claims equal to zero is considered that clarify the application of the model with a zero-inflat...
متن کاملA New Class of Zero-Inflated Logarithmic Series Distribution
Through this paper we suggest an alternative form of the modified zero-inflated logarithmic series distribution of Kumar and Riyaz (Statistica, 2013) and study some of its important aspects. The method of maximum likelihood is employed for estimating the parameters of the distribution and certain test procedures are considered for testing the significance of the additional parameter of the model. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1704.02211 شماره
صفحات -
تاریخ انتشار 2017